An empirical study on program failures of deep learning jobs

- Zhang, Ru; Xiao, Wencong; Zhang, Hongyu; Liu, Yu; Lin, Haoxiang; Yang, Mao